Picture for Federico Tombari

Federico Tombari

PARCEL: Pool-Anchored Resampling with Conditioned Elastic Queries for Efficient Vision-Language Understanding

Add code
May 28, 2026
Viaarxiv icon

SA4Depth: Consistent Pose-Depth Scale Alignment for Self-Supervised Monocular Depth Estimation

Add code
May 27, 2026
Viaarxiv icon

Good Token Hunting: A Hitchhiker's Guide to Token Selection for Visual Geometry Transformers

Add code
May 22, 2026
Viaarxiv icon

Stitched Value Model for Diffusion Alignment

Add code
May 19, 2026
Viaarxiv icon

OpenGaFF: Open-Vocabulary Gaussian Feature Field with Codebook Attention

Add code
May 07, 2026
Viaarxiv icon

SSL-R1: Self-Supervised Visual Reinforcement Post-Training for Multimodal Large Language Models

Add code
Apr 22, 2026
Viaarxiv icon

R-CoV: Region-Aware Chain-of-Verification for Alleviating Object Hallucinations in LVLMs

Add code
Apr 22, 2026
Viaarxiv icon

TAPNext++: What's Next for Tracking Any Point (TAP)?

Add code
Apr 12, 2026
Viaarxiv icon

Stepper: Stepwise Immersive Scene Generation with Multiview Panoramas

Add code
Mar 30, 2026
Viaarxiv icon

OVI-MAP:Open-Vocabulary Instance-Semantic Mapping

Add code
Mar 27, 2026
Viaarxiv icon